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Abstract. We introduce a new model for the generation of random satisfiability 
problems. It is an extension of the hyper-SAT model of Ricci-Tersenghi, Weigt 
and Zecchina, which is a variant of the famous A-SAT model: it is extended to 
g-state variables and relates to a different choice of the statistical ensemble. 

The model has an exactly solvable statistic: the critical exponents and scaling 
functions of the SAT/UNSAT transition are calculable at zero temperature, with 
no need of replicas, also with exact finite-size corrections. 

We also introduce an exact duality of the model, and show an analogy of 
thermodynamic properties with the Random Energy Model of disordered spin 
systems theory. Relations with Error-Correcting Codes are also discussed. 
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1. Introduction 

1.1. Motivations 

Complexity theory concerns the classification of combinatorial optimization problems, 
according to the computational cost required for their solution |l], ^, |j| . 

A central role is occupied by the class of hard problems named NP (Non- 
deterministic Polynomial time). For problems in this class, a potential solution can 
be checked in polynomial time, whereas finding one solution may require, in the worst 
case, a time growing exponentially with system size. 

Often, the crude worst-case analysis of ordinary complexity theory hides the 
mechanism which leads to NP complexity. A statistical mechanical analysis of the 
ensemble of possible instances of the problem (we could say a "typical case" analysis, 
eventually dependent on some macroscopic parameter of the theory) would have the 
ambitious challenge of deepening the understanding of computational complexity, and 
improving the algorithm design for real world applications. 

For example, in a variety of cases, the problem admits a space of parameters, and 
a certain order parameter exists, showing a critical behaviour. In some of these cases, 
a widespread conjecture is that the computationally hard instances appear with a 
significant probability only when generated near "phase boundaries" of the ensemble, 
with respect to this phase transition. 

An exemplar case of this phenomenon is the random Boolean satisfiability problem 
(iv-SAT (§ §,||). In this case, a pecular transition, called SAT/UNSAT transition, 
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occurs: when the density of clauses is higher than a certain critical value, the system 
is highly constrained, and the average probability of having a solution goes to zero 
in the thermodynamic limit. Otherwise, the system is slightly constrained, and the 
average probability of having a solution goes to one in the thermodynamic limit. 
Hard-to-solve instances can be found only in a narrow region on the density axes, in 
correspondence of the critical density. This fact has a clear interpretation: heuristic 
algorithms for randomly searching a solution are efficient when the density is low 
enough, while algorithms for randomly searching a contradiction are efficient when 
the density is high enough. At the phase boundary, however, both the average 
probability of being satisfable or unsatisfiable are order 1, then heuristic (incomplete) 
search algorithms have no way to disentangle, in a given finite time, the unsatisfiable 
instances, from those which are simply very hard to solve. So, in this region there 
appears an exponential critical slowing down which makes the search inefficient for 
any practical purpose. 

A sub-class of the NP class exists, called NP-complete, such that, for each 
problem in this sub-class, no other NP problem is more than a polynomial factor 
harder. Informally, a problem is NP-complete if answers can be verified quickly (NP 
requirement), and a quick algorithm to solve this problem can be used to solve all 
other NP problems quickly (completeness requirement). NP-complete problems are 
particularly important, as, in some sense, they manifest all the potential difficulties of 
any NP problem, and are a sort of "language" for properly describing every possible 
problem in NP class. The iT-SAT problem is a NP-complete problem for K > 3. 
For its simplicity in formulation, and versatily in encoding other NP problems, it has 
acquired a paradigmatic role for the study of NP complexity. It has been studied 
both directly (see e.g. ]?], |), or, for a review, Q and references therein), or by the 
introduction of more "handable" variants which reproduces some of the peculiarity 
of K-SAT problems. One step in this direction is the introduction of the hyper-SAT 
model by F. Ricci-Tersenghi, M. Weight and R. Zecchina (see || |l0|]). 

1.2. Presentation of the material 

In this paper, we shall discuss a very simple and exactly solvable model for the 
generation of random combinatorial problems. The model has deep analogies with 
the hyper-SAT model, and inherits from it some important characteristics: 

• it is hard to handle by local search methods in the whole region of the parameter 
space; 

• the problem may be solved in polynomial time by a simple global method (Gauss 
reduction) and therefore belongs to the class P; 

• in a variant (unfrustrated model) one solution may be superimposed by 
construction. 

The most important feature of our model is the fact that also the statistical properties 
of the model are exactly known, at finite and infinite size, thus yielding to a wide 
amount of analytical results. For example, we obtain the exact expression for the 
probability distribution of the rank of a M x N random matrix in GF(q), result which 
was previously known only in the limit N — > oo at constant M — N. This result 
contains all the information about the average "satisfiability" probability, and much 
more. The methods used in the paper can be extended in order to calculate other 
interesting quantities. 
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Furthermore, the simple underlying geometry of the model allow for a clarifying 
graph-theoretical interpretation of the problem, which leads to the formulation of an 
exact duality of the system. 

The paper is organized as follows: 

(i) In this introductory section we shortly describe some problems related to our 
model: random linear systems, the K-SAT model and the hyper-SAT model. 

(ii) In the second section we discuss of some related fields of application, mainly Error 
Correcting Codes Theory and Hypergraphs Theory. 

(iii) In the third section we introduce the model, and describe its main properties, 
with particular attention to the duality and relation deriving from it. 

(iv) In the fourth section we exactly calculate the free energy distribution function at 
zero temperature, solving a realization of the "classical moment problem" . 

(v) In the fifth section we rederive the results of the previous section, by mean of a 
direct analysis of the Gauss algorithm which allows for a depiction of the structure 
underlying the ensemble of instances. 

(vi) In the sixth section we derive the conclusions of our work, and anticipate the 
directions of our forthcoming research. 

1.3. A prolog: systems of random linear equations in GF(q) 

We will start with a small prolog of elementary linear algebra, as it will provide a nice 
setting for our model. 

Systems of linear equations, beyond the ordinary theory on R or C, can be studied 
on more generic structures. For example, in the textbook JHJ, following the original 
work of Kovalenko and collaborators |[2| [Hi), a certain number of questions about 
systems of linear equations on the Galois field GF(q) is answered. 

The Galois field GF(q) 7 with q a prime number, is a finite field whose elements 
are the integers in Z g (i.e. the integers from to q — 1), and sum and product are 
defined modulo q: 



The prototype of a linear system Ax = v, or of a homogeneous linear system 
Ax = 0, in GF(q), when the above definitions are assumed, maintains the same 
formal structure. It also preserves most of the fundamental results of linear algebra, 
like the existence of a Gauss algorithm for triangulating a matrix, or the fact that the 
rank plus the dimension of the kernel is equal to the number of rows of the matrix. 

Statistical mechanics plays a role when we consider averages over a certain 
ensemble of matrices and vectors, in a sort of "thermodynamic limit" on their 
dimension. 

For example, given two integers N and M, and a prime number q, a good 
statistical ensemble is the set of the couples (^4, v), with A an M x N matrix, and v an 
M -dimensional vector, with elements randomly chosen in Z g . A statistical mechanic 
would calculate, for a certain fixed value of 7 = N/M, in the limit N — > 00, the 
average satisfiability of this problem, i.e., for a random "extraction" of an instance 
(A, v) from the ensemble, which is the probability that the linear system in GF(q), 



(xi + x 2 ) G F( q ) = X\ + x 2 (mod g) ; 
(xi ■ x 2 )GF(q) = xi ■ x 2 (mod q) ; 



(xi + x 2 ) GF ( q) E Z q ; 

(xi ■ x 2 ) GF ( q) e 1 q . 



(la) 
(lb) 
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Ax = v, has at least one solution. In more detail, we could calculate which is the 
probability distribution for the dimension of the kernel of the linear application A, in 
the limit of infinite A and M, with N — M fixed. 

This last point is exactly the main question answered by the Russian school in 
Ipl, which will be rede rived as a preliminary result in this paper. 



1.4-- Nomenclature 

In the framework of statistical mechanics, it is frequent to work with spin variables 
a = ±1, while in the framework of random satisfiability, it is customary to work 
with Boolean variables b G {TRUE, FALSE}. We will often switch between the two 
formulations, with the table of correspondences 



b 


a 


TRUE 
FALSE 


1 

-1 



So, we will consider a set of A spin variables Oj, with j = 1, . . . , A, or, alternatively, 
a set of A Boolean variables bj. Each of the 2 N choices of the variables, denoted by 
(j, or by b, will be called a configuration. 

A random satisfiability model is defined in terms of an ensemble of random 
instances, that is a collection of M = A/7 Boolean conditions on the variables (called 
clauses), randomly chosen from a certain set. This set of the possible clauses will 
primarily distinguish the different models, so it will be discussed further, case by case. 

Given a certain instance, if we have at least one configuration which 
simultaneously satisfies all the clauses|, the instance is a satisfiable (SAT) one, else it 
is an unsatisfiable (UNSAT) one. 

We will introduce a fictitious Hamiltonian for each instance. Our choice is to 
assign to a configuration an energy equal to the number of unsatisfied clauses. The 
low temperature limit of the partition function calculated with this Hamiltonian is 
the number of solutions for a certain instance. 

We will adopt the graphical convention for the logical operators A (and), V (or) 
and V (xor). 



1.5. The K -SAT model 

In this model, given an integer K, the set of clauses {C} is the set of all the A-uples 
of variable indices n±, . . . , uk, with a Boolean assignment Sj attached to each index. 
An instance in the ensemble is a subset X of this set of clauses, with cardinality M, 
and with no repetitions admitted. 

A certain configuration will be a solution if 

bsolut. <^> f\ I \/ (sfv6 n w) ] =TRUE. (3) 
Cei \j<K 3 J 

(i) 

If we call crj the spin variables associated to the Boolean variables bj, and r,- the 
spin value associated to the Boolean assignments Sj , both following the table (||) , a 

I It will be "one non-trivial configuration" in the model with a superimposed solution. 
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possible spin Hamiltonian which counts the unsatisfied clauses is 

^=isnMV)- (4) 

Ci€lj=l 

Interpreted as a spin model Hamiltonian, the system has random diluted quenched 
couplings, with a complicated pattern of interactions: for each if-uple of variables, 
if we have a if-spin coupling between them, then we have also couplings between all 
the subsets of the K variables, with a choice of signs causing the 2 K levels of the 
if- variables 1-clause subsystem to split in a 2 K — 1 multiplet of energy —1, and one 
level of energy 2 K — 1. 



1.6. The hyper- SAT model 

This model, introduced by Ricci-Tersenghi, Weigt and Zecchina in J9| (see also ifjof ) 
as a variant of the K-SAT with a more clear spin structure, is defined as follows. 

Given an integer if, the set of clauses {C} is the set of all the if-uples of variable 
indices ni, . . . ,tik, with a single Boolean assignment s attached. An instance in the 
ensemble is a subset X of this new set of clauses, with cardinality M, and no repetitions 
admitted. 

A certain configuration will be a solution if 



b solut. 




= TRUE . 



(5) 



If we call tW the spin value associated to the Boolean assignments sW, again following 
the table (0), a possible spin Hamiltonian which counts the unsatisfied clauses is 



CGI 



(6) 



This ensemble of Hamiltonians has the form of a diluted mean field if -spin disordered 
system, that is a spin model with TV variables and M quenched if -spin couplings, 
randomly chosen between all the if-uples of variables, and with values randomly 
chosen in { — 1, 1}. 

A variant of the problem, named "unfrustrated" model, is the restriction of the 
clauses set only to the clauses with s = TRUE (that is, a ferromagnetic diluted if -spin 
system). In this case we will have the obvious "ferromagnetic" solution bj = TRUE 
(or G.- L = +1) for each instance. 



1.7. The hyper-SAT model as a random linear systems 

The hyper-SAT model can be reinterpreted in the framework of random linear systems 
we introduced in the prolog. Given a hyper-SAT instance with N variables and M 
clauses, we can introduce an M x N matrix J y - in the field GF(2), such that Jy = I if 
the variable bj appears in the i-th clause, and Jij = elsewhere. With the expanded 
table of correspondences 



b 


a 


X 


TRUE 
FALSE 


1 

-1 



1 



(7) 
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we have the composition correspondences 

6iV6 2 = (T1CT2 = (xi + x 2 )gf(2) ; 

so, for each clause d, 

v ^=n- 

So, at the end, we can analyze the problem in the more handable form Jx = v 
(frustrated model), or Jx = (unfrustrated model). 

The shift to K-sp'm models will be given by the correspondence (0), so the 
Hamiltonian in the spin formulation will be 

«j,vW = 5E( 1 -(- i rn°-/ y )- ( g ) 

while in the linear system formulation will be 

w JlV (x) = E ( x - ME J * x i -«*))• ( 9 ) 

with #2 = 1 if 7i is even, and if n is odd. As in the following we will generalize 
the system to (/-state variables, we will introduce a S q function with the dehnition: 

«->={; t^t ■ <«» 

Beyond the correspondence between a specific hyper-SAT instance and a specific 
random linear system, we remark that the choice of the ensemble is crucial for the 
statistical properties of the system. The statistical ensemble of the hyper-SAT is very 
different to the random ensemble we suggested as an example in the prolog, as in the 
hyper-SAT we have exactly K non zero terms in each row of J. 

2. Random satisfiability, graph theory and Random Linear Codes. 

2.1. Linear codes in information theory 

A main problem in information theory consists in signal reconstruction when a 
sequence of bits is transmitted through a "noisy" channel, that is a channel where 
transmission errors can occur with a certain probability. Just to fix the ideas, we 
will concentrate on the so called symmetric noise channel, in which each bit has a 
probability (1 — p) to be transmitted unaltered, and a probability p to be transmitted 
as its negate: 





1 




f 


1 -p p 
p 1-p 



When p — no error can occur, and transmission is for granted. When p = 1/2 
the output signal is totally decorrelated from the input signal, and no inference is 



GF(2) 
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possible even in principle. In the intermediate cases, the task is to use the information 
transmitted through the channel, in the form of the partial correlation between input 
and output signals, to obtain the safe transmission of a shorter signal. 

This problem can be easily generalized to a generic "alphabet" , with characters 
in 7L q (i.e., instead of having bits in {0, 1}, we have variables in {0, 1, . . . , q — 1}). A 
generalization of the symmetric noise channel defined above could be as follows: each 
character has a probability (1 — p) to be transmitted unaltered, and a probability 
to be converted in each of the other characters. For example, when q = 3 we would 
have 








1 


2 





l-p 


p/2 


p/2 


1 


p/2 


l-p 


p/2 


2 


p/2 


p/2 


l-p 



In this case, the totally decorrelated case occurs for p — . 

A code is an algorithm which fulfills the task above, replacing the interesting 
signal with a redundant but error-robust encoding of it. The ratio between the length 
of the original and of the encoded signals is called the rate of the code. A critical rate 
exists such that, in the limit of infinite length of the message, the decoding procedure 
almost always succeeds in reconstructing the original signal from the noise-distorted 
transmitted stream. In most cases, the probability of success as a function of the 
rate jumps to one in an abrupt way, typical of critical phenomena (reconstructable- 
unreconstructable transition) . 

The quality of a code is mainly given by two parameters: the critical rate at 
given error probability and the computational complexity class of the encoding and 
decoding algorithms. 

A family of codes, called linear codes MM, is closely related to random satisfiability 



problems and certain spin glass models in statistical mechanics [15 , and a deep insight 
into the reconstructable-unreconstructable transition can be derived from the study 
of the phase diagrams of the related spin glass models |l(| . A well-known example of 
linear code, called Varshamov's linear code, or RLC (Random Linear Code) |l7], |l8| 
has been the first discovered code to reach the Gilbert- Varshamov bound on the rate 
of the code. The Varshamov's code is closely related to our model. 

Suppose we have a random linear system Jx = v in GF(q), with M equations, 
in N > M variables, and suppose the kernel of the linear application J has the 
minimum possible dimension L — N — M. This linear system can be used for the 
error correcting transmission of a message of length L, on an alphabet of q characters. 
First we introduce some definitions: 

(i) The possible original messages are the vectors y of the linear space Z g L . 

(ii) The encoding map £(Z q L — > 7h q N }, which encodes the original L-bits message in 
the redundant A-bits codeword, is defined with the aid of the linear application 
J. If we choose a basis for the kernel 

ker(J) = span(s (1) , . . .s (L) ) , (11) 

the map will be defined by 

L 

£ : y->x(y)=^2/iS (i) . (12) 
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(iii) A distance on the codewords space Z q N will be introduced: 

N 

d(x,x') = ^[l-<5(x i -^)] . (13) 

3=1 

Then, for the transmission of a certain message, we follow the procedure: 

(i) The original message is a vector y mos £ Z, 1 '. 

(ii) We use the map £ for the encoding. The original codeword is x mcs = £ (y me s)- 

(iii) The codeword is transmitted through the noisy channel. The transmitted 
codeword will be a vector x trans . The probability distribution for the transmitted 
codeword depends on the kind of noise we consider. In the case of symmetric 
noise we have 

pW = (1 - rt "((T^T)) ' (14) 

(iv) for the reconstruction of the signal could be used, for example, the Minimum 
Distance Decoding criterium, that is, if we call x roc the minimum of the 
function <i(x, x trans ) in the image of the encoding map £ (Z g L ), we choose as 
the reconstructed message y rcc the vector y roc = £ _1 (x rcc ). 

The interesting question is, in the "thermodynamic" limit of long signals N — > oo, 
with the rate R = L/N fixed, which is the probability that the reconstructed message 
y rec is identical to the original message y mC s, as a function of the noise parameter p 
and of the rate R. 

A statistical study |l(| shows the tricky fact that this probability tends to one 
if R < R c (p), and it tends to zero if R > R c (p), and that the critical line R c (p) is 
exactly the Shannon bound for the capacity of a symmetric noise channel |fj~i| . [l8| . 



2.2. Random graphs and statistics 

Another mathematical framework in which this kind of satisfiability problems is well 
depicted is hypergraph theory. In these sections we will shortly introduce some 
notation which will be useful in the following, first starting with ordinary graph theory, 
then jumping to hypergraphs (for an introductory bibliography, see |19| , |2(fl). 
Given 

V = {pi, ■ ■ ■ ,p N } 

a finite set of cardinality N, called the set of "vertices", we define the set of all the 
non-ordered couples of vertices in V 

£ = {(PnuPna)} n l ^ n 2 

as the set of "edges". For each subset E C £ of cardinality M, the datum of the set 
of vertices and of this set is called the graph T(V, E), with N vertices and M edges. 

Given a vertex pi, its coordination (or degree) is defined as the number of edges 
Ej G E such that pi £ Ej. A path between the vertices p a and pb is a sequence of 
vertices and edges in E 



(Pa,E 1 ,p 1 ,E 2 ,P2, ■ ■ ■ ,E L ,p b ) 
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such that pi G Ei,E i+ i; the number of edges L is called its length. A graph is said 
connected if, for each couple of vertices, exists a path which joins them. 

A non-trivial closed path is usually called a loop. More generically, we will adopt 
the "physical" convention of using the term loop for any subset of the set of edges 
El C E such that the graph, restricted to this set T(V,El), has only vertices of even 
(eventually zero) coordination. 

In the limit of large N and M, we can introduce an ensemble of graphs, giving 
a probability measure over the possible sets of edges, and we can calculate statistical 
averages of interesting quantities. For example, an ensemble could be the one in which 
only connected graphs with M edges are admitted, and no repetitions of edges are 
allowed. 

2.3. Hypergraphs and K -SAT problems 

A generalization of graph theory concerns objects in which the set of possible edges £ 
is replaced by the set of all non-ordered K-uples of vertices (typically K > 3), called 
hyper- edges: 

£k = {(PnuPm, ■ ■ ■ ,Pn K )} n i n 2 + ■ ■ ■ + n K . 

The datum of the set of vertices and of a subset of admitted hyper-edges will be called 
a hypergraph. 

The notions of coordination, path, connectivity, and loop, as we defined them in 
the previous section, immediately generalize to this case. In particular, the notion of 
hyper-loop, as a subset El C E such that the relative hypergraph T(V,El) has only 
vertices of even coordination, will be crucial in the following. 

The ensemble of connected hypergraphs with M hyper-edges, with repetitions 
allowed, is the proper graphical scheme for describing the topological features of 
ordinary K-SAT problems, and also of the hyper-SAT, although, as we already 
mentioned in the previous section, we should also imagine some Boolean assignments 
attached to the hyper-edges (one for each vertex of each hyper-edge in the case of 
K-SAT, just one per hyper-edge in the case of hyper-SAT, and no assignments at all 
in the case of unfrustrated hyper-SAT|). 

Another possible variant could be the one in which hyper-edges with any number 
of vertices arc admitted 

N 

£y= (J £k ■ 

K=0 

The ensemble of all possible hypergraphs of M hyper-edges in £y, with homogeneous 
measure, and no repetitions allowed, is the one which describes the model we will 
define in the next chapter. 

2-4- Group structure of the hyper-SAT and the role of hyperloops in satisfiability 

In the hyper-SAT model the configurations shows a group structure. The group 
operation is the sum term-by-term in the linear system formulation, or the product 
term-by-term in the spin formulation, that is 

(mod 2); a^=afaf. (15) 

§ This immediate correspondence with the hypergraph interpretation is the reason for the name. 
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In the unfrustrated model, the set of solutions is a subgroup Z 2 S , with g generators, 
of the configuration group. 

The fact that in the unfrustrated hyper-SAT the set of solutions is a subgroup is 
an immediate consequence of the fact that the kernel of a linear application in GF(2) 
is a subspace Z 2 9 of the vector space Z 2 N ■ 

Let's go back to the formulation of a hyper-SAT instance in terms of a system of 
linear equations in GF(2): 

K 

£ X n W = V i ■ 

3 = 1 3 

We can graphically identify each variable with a vertex, and each clause with a hyper- 
edge joining the vertices which appears in it. 

Each subset of equations in the system can be identified with the sub-hypergraph 
(in the following just hypergraph) of the original one, obtained by restriction to the 
correspondent hyper-edges. The sum of the equations in the subset will be called the 
equation associated to the hypergraph. 

Note that also hypergraphs have a group structure Z 2 M : given two hypergraphs, 
the group operation is the disjoint union of the two hypergraphs, from the graphical 
point of view, and the sum of the two corresponding equations, from the algebraic 
point of view. The identity is the empty set. 

The set of hyperloops is a subgroup Z 2 9 of the hypergraph group. When the 
subgraph is a hyper loop, the associated equation is a relation involving no variables: 
in the unfrustrated case, it is just a trivial identity; in the frustrated case, it is 
a configuration-independent characteristic of the instance which could eventually 
determine an a priori unsatisfiability. Because of randomness in the choice of the 
for the instance, this will happen with 1/2 probability for each linearly independent 
hyperloop. The number of linearly independent elements in a group Z 2 9 is just g' . 

So the sentence in which explains the role of hyperloops in the frustrated 
model: 

"As soon as one hyperloop arises in the hypergraph, half the formulas become 
unsatisfiable..." 

can also be interpreted as "As soon the average number of generators for the hyperloop 
group, g' ', increases of one unity, the average number of solutions would decrease of a 
factor one half (if we were working with the frustrated model), so the average number 
of generators of the group of solutions, g, decreases of one unity" . 

We can see that an intriguing symmetry of roles between solutions and hyperloops 
seems to arise. This symmetry of roles is more clear in the linear system formulation, 
as we have that a homogeneous solution is a solution of Jx = 0, while a hyperloop is a 
solution of J T y = 0. In our model, the analytical solvability will provide a clarifying 
reinterpretation of these heuristic arguments. 

3. The model 

One difficulty on an exact analytic approach to the hyper-SAT is that the ensemble 
of formulas is not symmetric: for an admitted clause, the number of 1 in each row 
must be exactly K, while there is not any similar constraint on the columns. This 



An exactly solvable random satisfiability problem 



11 



fact introduces difficult-to-handle correlations on the values of the entries in the grid 
formulation. 

In this section we introduce a model defined on a larger ensemble, which cancels 
correlations between the entry of the matrix, and fully exploits the symmetry of roles 



between solutions and hyperloops we suggested in section 2.4 



3.1. Description of the model 

The model we are going to define is a random satisfiability problem. So it will be 
defined by an ensemble of instances, with a probability measure on it (the possible 
realizations of the problem, with associated the probability that such a realization 
occurs), an ensemble of configurations (the potential solutions), and a notion of 
satisfiability. 

Given the integer parameter q, which must be a prime number, and the "noise" 
parameter p G [0, 1] (as it will have a role similar to the one of the noise parameter of 



Error Correcting Codes, see section 2.1), the ensemble of the instances is the product 
set of the set of all the M x N matrices J^ , filled with integers in Z 9 , each matrix 
being equiprobable, and the M-dimensional vector space 1^ q M , where the probability 
for a vector v to be extracted is dumped accordingly to the number of its non-zero 
entries. More precisely 

"«v) = S -™.(i-P)"(^TTy) 

where we used the short notation of equation (|l^) ; the parameter p has been chosen 
to reproduce the average fraction of non-zero elements in v. The parameter 7 = N/M 
will determine the average solvability of the problem: large values of 7 corresponds 
to weakly constrained systems, and a typical instance will be solvable, while, on 
the contrary, small values of 7 correspond to highly constrained systems, and a 
typical instance will be unsolvable (or will have only the superimposed solution in 
the unfrustrated case, see below). 

The ensemble of the configurations is the ./V-dimensional vector space X q N . A 
certain configuration x is said to be a solution if it solves the linear system 

Jx = v . 



The particular case of p — is called unfrustrated, as we have a superimposed 
"ferromagnetic" solution x for each instance J. When p / we will deal with the 
frustrated version. When p reaches the value p = (q — l)/q, the model will be called 
totally frustrated, as the probability measure is flat also for the inhomogeneous terms 
v. 

A particular realization of the problem (instance) will be said satisfiable (SAT) if 
we have at least one solution, in the frustrated model, and at least one more solution, 
after the superimposed one, in the unfrustrated model. Elsewhere, the instance will 
be said to be unsatisfiable (UNSAT). 

Although in this part we only study the average solvability of the problem, it is 
customary of a statistical mechanical approach to introduce a fictitious Hamiltonian, 
such that the solutions correspond to the states with zero energy, and all the other 
configurations have a rigorously positive energy, study the thermodynamic of this new 
system, and recover the original model in the zero temperature limit. 
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Obviously, there is a large arbitrarity in the choice of the Hamiltonian, and we 
will analyze various possibilities in the third part of the paper. Anyway, just to fix the 
ideas, we will concentrate on the most simple choice, the one in which the Hamiltonian 
just "counts" the number of violated clauses: 



M / N 



= E y - s *{12 J v x i ~ v *)} > ( l6 ) 

A more familiar spin reformulation is 



n 



At I N \ 

/(*)=££ U-^n-f - 1 ) • ( i7 ) 
i=i \ j=i j 



where the correspondence is 



e i 



In the homogeneous case every S4 = 1. 

This model shares with the Random Energy Model |2l], |22|, ^3], ^5), the 
property that the energy of different states are (almost always) uncorrelated. More 
precisely, it is true that, if (E,Ei, . . . ,E k ) are the energy respectively of the states 
(a, , . . . , er( fc )), calculated with a certain instance, for each couple of functions /(x), 
g(xi, . . . , Xk) we have 



f(E)g(E u ... ,E k ) = f(E)-g(E 1 ,...,E k ) 

if o span(cr^ 1 \ . . . , crW), and the fraction of states for which this last condition is 
verified is about 1 — q~ N+k . 



3.2. The underlying group structure 



As we anticipated in section 



2.4 



a nice feature of our model, which is common 



also to the hyper-SAT model and to the random linear system problem, is that the 
configurations shows a G„ = Z q N group structure. 

The group operation in the linear system formulation is the sum term-by-term of 
the configuration entries: 

x { ; b) = xf ] + xf ] (mod?);. (18) 

In the unfrustrated model, the set of solutions is a subgroup Gs = Z 9 9 , with g 
generators, of the configuration group. 

In the frustrated model, given an instance (J, v), we define the associated 
homogeneous problem the one in which we search for solutions of the homogeneous 
system Jx = (that is, the problem in which we turn all the Boolean assignments 
S(j) to TRUE) As we said above, the set of solution of this homogeneous problem is a 
subgroup Gs — Z 9 9 . If we have at least one solution x for the original problem, the 
set of solutions of the original problem is the lateral class x + Gs ■ 

These statements have a clarifying correspondence in the random linear system 
on GF{q) framework: the space of admitted configurations x is the discrete torus X q N ', 



An exactly solvable random satisfiability problem 



13 



with, as a group operation, the sum of discrete vectors quotiented by periodicity. The 
fact that in the unfrustrated hyper-SAT the set of solutions is a subgroup, and in the 
frustrated case it is a lateral class, is in correspondence with the fact that the set of 
solutions of a linear system on GF(q) is a subspace 1 q of the vector space 1 q N in 
the homogeneous case, and an affine subspace in the inhomogeneous case. 

The model naturally shows also a gauge invariance. As the Hamiltonian is of the 

form 

Hj, v (x) = W(Jx - v) = W(w) , 

it will remain unchanged for transformations which keep unchanged the vector w, that 
are the changes of basis and the translations on the torus of configurations "L q N ' . The 
more general gauge transformation Q of this kind is 



-> x' = £(x + 
G{B,0- { J -> J' = JB- 1 (19) 
-► v' = v + J£ 



Where B G GL (N, GF(q)) (change of basis) and £ G Z 9 (translation). 

This gauge structure is purely a consequences of the vectorial form of the 
combination w = Jx — v. We can derive a larger gauge group from the peculiar 
form of the Hamiltonian 



M 

W(w)=£(l -*,(«„)) . 

i=l 

From this form we deduce that vectors w 1; w 2 , differing only for a permutation of 
the elements, and for the product of each entry for a non null value, have the same 
energy. The group {1, 2, . . . , q — 1}, with the product of GF(q) (q a prime number), 
is called Z*. We have M copies acting independently on the M entries of w. Then 
we have the permutation group of M elements, &m- The group whose elements are 
a permutation of the M entries, followed by the product of each entry for an element 
of Z* , is the group[| 

(z*) k e M 

We call q—Perm [M, GF(q)) the set of M x M matrices which is the M-dimensional 
representation of this group (that is, the matrices with exactly one non zero element 
in each row and in each column. Note that the normal subgroup is the subgroup of 
diagonal matrices). So the most general gauge transformation Q is 

r x -> x' = B(x + 
g(S,B,£):\ J -> J^SJB- 1 (20) 

Where S G q—Perm(M,GF(q)) (permutation and product on the entries of w), 
B E GL (N, GF(q)) (change of basis) and £ G 7L q N (translation). 

In the case of the unfrustrated model, in which we impose the vector v to be the 
null vector, we will restrict to £ = gauge transformations. 

|| The symbol K means that (Z*)® M is a normal subgroup of the whole group 
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A common tool when dealing with systems with quenched disorder is the replica 
trick. This method leads to calculate annealed averages over a replicated system, that 
is, if the number of replicas is m, the space of configurations is the (tensor) product 
of m copies of the original one, and the Hamiltonian of the replicated system 7^( m ) is 
the sum of the Hamiltonians calculated on the different configurations of each replica: 

m 

H( m) (x 1 ,..,x m ) = ^H J (x ). (21) 
0=1 

We are interested to the "annealed" partition function of this system, that is the sum 
of the Gibbs factor e~ l3H -' ( X1 >---' x ">) on all the configurations (xi,... ,x m ) and all 
the instances J. 

In this frame, and in our peculiar case, a new symmetry arises between replicas 
and clauses, which enlarges the gauge group. If we call 

and defining 

N 

^Vi a — ^ JijJja 3 

i=i 

we have for the Hamiltonian the symmetric expression 

m M 

« w w=EE( 1 -w ■ ( 22 ) 

From this expression we see that we could permutate the mM elements of the matrix 
W in an arbitrary way, and multiply each of them for an independent element in 
1q . Such a general class of transformations has not a clear action on the physical 
variables (J, J). We could restrict ourselves to the action of a permutation matrix for 
the clauses S G q—Perm(M,GF(q)) on the left, and a permutation matrix for the 
replicas <S" G q—Perm (m, GF(q)) on the right. So we have that, as a consequence of 

H {m) {W) =H {m) {SWS') 

the general gauge transformation is of the form Q: 

{jT j/T _ q t t R -i 

j Z J'Jbjs (23) 

Where S G q—Perm(M,GF(q)) (permutation and product on the clauses), B G 
GL (N, GF(q)) (change of basis on the variables space) and 5" G q—Perm (m, GF(q)) 
(permutation and product on the replicas). 



3.3. The clause-variable (C-V) duality 

For the unfrustrated model we can state a notion of duality which formalizes the 
heuristic arguments of section [j.4| on the symmetry of roles between clauses and 
variables. We will call this duality the C-V duality. 
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The C-V duality maps a M x N instance of the problem in a N x M instance. 
The matrix J' associated to the new instance is simply the transposed of the original 
one J' = J T . As the duality brings the density parameter 7 in 7 _1 , it has a "fixed 
point" in this parameter space at -f c = 1. We will see that this density is also the 
critical density for the SAT-UNSAT transition. 

In a graphical representation the dual of a generic configuration is a generic 
hypergraph, and the dual of a solution is a hyperloop. 

More generally, we can consider all the degeneracy numbers of the energy levels 
in the two problems 

sW^WJ.n); ir( m )=2<5(W JT (r),m); (24) 

which will define a sort of two dual partition functions associated to each instance 

Z(J) =^S* ( " ) e- /3n ; Z {T \J) = J2 H(m)e ~ >3m ■ (25) 

n m 

A first easy statement deriving from duality could be 

Z{J T ) = Z^\.J)- Z^(J T ) = Z(J). (26) 

But the duality provides much stronger relations between arbitrary functions in the 
operators and H {m ^. 

.f(5 (ll) (J),^ (m) (^)) = f{H^ n \j T ),S^ m \j T )) . (27) 
And then, averaging over the ensemble of instances: 

f(S^,H(™)) N M = ./•(#(«), S(™)) M Af finite-size; (28) 

/(£("), #( m )) = /(i/("),S'( m )) 7 _ 1 infinite size limit. (29) 

In the following we will often omit the superscript in and H^°\ referring to them 
simply as S and H. 

3.4-. Expectation values of S and H moments 

In what follows we will give elementary combinatorial derivations of the expectation 
values for a certain set of polynomials in S and H. 

As the formal definition of the number of solutions S is 

5(j) = ^<y(w J ((7),o) ) 

(7 

a nice feature of this operator is that averages over a and over J easily commute, so 
that 

S = 2 S(Hj(a),0) = ]T S(n.j(a), 0) . (30) 

a a 

We will call A4(<? 9 ) the number of subsets of order k of linearly independent elements 
in a group Z q 9 . This quantity can be deduced from a simple inductive argument: if we 
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already have k — 1 linearly independent elements, we can choose the fc-th one between 
the q 9 elements of the whole Z q 9 , unless it is already one of the q k ~ 1 elements of the 
Z g fe_1 generated subgroup, so we must have 

W) = A/U(<f) • (q 9 - q k -') • 

For the case k = 0, we have only the empty set, so Afo(q 9 ) — 1. This implies: 

M(q 9 ) = i[(q 9 -q 1 )- (31) 

i<k 

We will introduce a short notation for the recurrent products: 

b 

K a , b = n o- 

so that Mk{q 9 ) = q k9 1Z g -k, g - The expressions [^] are known in literature as "Gauss 
polynomials" ; in this specific case they are polynomials in the variable g _1 (see 
Appendix A ). 

A nice feature of our ensemble is that a change of basis doesn't affect the 
probability distribution of the ensemble of J grids, as it is flat over all the possible 
grids. 

Given a certain set of k linearly independent configurations {cr' 1 ', . . . , cr^ k '}, we 
can calculate the probability that all of them are solutions of a given instance, averaged 
over the instances. 

Using the stability of the ensemble under change of basis, we could pass to a 
new basis in which the first k elements are the crW, . . . ,a^ k \ As they are linearly 
independent, this change of basis is not singular. It is easily seen that they are all 
solutions of the instance if and only if the first k rows are filled with zeroes, so we 
have 

II W^o) - (q- M ) k ■ (33) 

i=l,— ,k 

Collecting equations (|3C|j3l| , |33l ) we get 

n(s-<r)=A4(s)= ^ n^P^o) 

i<k {o-C 1 ),... , CT < fc )} i 

- e n^(- w )'°) (34) 

{(T* 1 ),... ,(T< fc )} i 

= N k {q N )q- kM =q k( - N - M) 1l N ^ N . 
The dual of equation (|34|) is: 

\{(H-t) = Jf^H) = Af h (q M ) q- hN = q-^ N - M ^n M -h,M ■ (35) 

i<h 

A more complex question we can answer is the probability that k linearly independent 
configurations {cr*- 1 -*, . . . are solutions of a certain instance J, and h linearly 



n 
m 



/v n— ? 

no.-. 



11, 



0,1! 



(32) 
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independent configurations {r^, . . . , t^' 1 '} are solutions of its dual instance J T , at 
the same time. With a similar "double" change of basis we find that both the first k 
rows and the first h columns should be filled with zeroes, so 



[] S(Hj(aW) 7 0) H 5{UjT(rU)),Q) = a kh - kM - hN . (36) 

i=l,"- ,fc j— l,--- .h 

If we collect equations (p0,pl],p6h we obtain that 



i<k j<h (37) 



kh (k-h)(N-M) n-, 
q q K-N-kM l<-M-h 



M 



Notice that lZN-k,N and Ttu-hM are exponentially depressed finite-size correction 
in the thermodynamic limit. It is also worth noticing that relation ([37]) for h = k = 1 
reads 



(S-l)(H-l) = q(l- q -")(l- q - M )^ q , 

that is, the average of the product between the numbers of non-trivial solutions and of 
non-trivial hypcrloops, in the thermodynamic limit, tends to a constant, independent 
from the variable-to-clauses ratio 7 = N/M. This is the precise formalization of the 
heuristic solution-hyperloop correspondence of section 2.4. 



3.5. Expectation values for excited states degeneracy. 

In a way similar to the previous calculation, we could find the expectation value for 
expressions involving the number of excited states S^ n \ with n violated clauses, or by 
duality, the number of hypergraphs with n variables appearing in the associated 
equation. 

A quantity which has been considered important in the literature Q is H^ x \ as 
such a kind of hypergraph fixes the value of a variable, which will be part of the so 
called "backbone", defined as the subset of variables which take the same value on 
each solution. As one trivial solution is the ferromagnetic one, the size of the backbone 
is in some sense the magnetization of the system. 

The average probability that a certain variable configuration is a n-level excited 
state is exactly (^f)(<Z — l) n q~ M , as, after the change of basis which makes this 
configuration the first basis vector, it will be a n-level excited state if and only if 
we have exactly n non-zero entries in the first column of the grid: 

= W (q - 1)" (V w - M ) - q- M ) ; (38) 

= Q ( q - l)» (q( M ~") - q- N } . (39) 



In particular the expression for shows analytically the conjecture, supported 

by numerical simulations on traditional hyper-SAT [p|, that, by varying the density 
of clauses, level 1 hypergraphs arise at the same threshold of hyperloops, but in a 
discontinuous way. 
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4. Derivation of Pn,m{R) from integer moments of S 

In this section we will calculate the exact probability distribution pn,m{R) for the 
rank of M x N matrices on GF(q), first by calculating the expectation value of integer 
moments of the number of solutions for the related homogeneous system Jx = 0, then, 
solving the "moments problem" (see [p6[) deriving from them. 

The order parameter for the SAT/UNSAT phase transition is the probability that 
an instance has no other solution beyond the imposed one, that is, the probability that 
the rank R of the matrix J is equal to the number of variables N, so it is simply the 
value of Pn,m(N). 

4--1- Calculation of S k 

Equation (|34|), in the infinite size limit, reads 



H(S - tf) = q- kA ; (40) 

i<k 

where, as we said above, A = M — N. Given the linear system of the first fe equations, 
we could try to solve it, and find an expression for each moment S k . The linear system 
is: 

]TA fc/l s^<r fcA ; (4i) 

h 

where the entries of the matrix Akh & rc defined by 

fc-i fc 



Y[(S-q h )=J2A kh S h . 

h=0 h=0 

This product is analogous to the expression of theorem || in Appendix A, so we obtain 
U(S a h ) = (-l) fe ^ ff(l - Sa- h ) = J2(-D k - h S h q ^- h ^ 



h=0 h=0 h=0 

and, from the definition, we recognize: 



A kh = (-l) fc -\ 



(42) 



Note that Akh is a triangular matrix, as obvious, as in the z-th equation only moments 
up to iS 1 appears. This property is encoded in equation (|42|), as in lZk-h,k a zero factor 
appears when h > k. 

If we could find the inverse matrix Bik such that 

TO 

^r,B lk Akh = 6ih n>l,h, (43) 

fc=0 



we would solve the system. We have an ansatz for this matrix: 
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B, 



ik = q 



k(l-k) 



We should verify that 



E« 

k=h 



k{l-k) 



(-lf-\ 



fc(fc-l) h(h— 1) 



(44) 



(45) 



When I < h this is trivial, as we have no summands. When I = h it is still easy, as we 
have only one summand, which is trivially 1. When I > h we need a bit of algebra: 



E? fe 

k=h 



d-k) 



fc(fc-l) _ h(h-l) 



■i=0 



ft 



where i = k — h and j = I — h. We can recognize again the expression of theorem 
so the last quantity is 



i=0 



4-1 

= II( 1 -^ 1 9- n )= - 

n=0 



So, at the end we found the matrix which solves the linear system: 

k 



S k =B kh q- hA = J2l —Q 



-hA h(k-h) 



h=0 



4-2. Derivation of PA{g) 

If we write the expression for B^h as a polynomial calculated in q k , we have 

h-i 



Bh(x) = Yl 



i=0 



X — q 
qh _ qi 



B kh = B h (q k ) . 



As, by definition, S k = (q 9 ) k — ^2 a p{g){q k ) g ', we have 



E^(.9)(9 fe ) 9 -E^' lA ^(^ 

g h=o 



(46) 



(47) 



(48) 



(49) 



furthermore, as Bh(q k ) = for h > k, we can extend summation to infinity. In this 
form, we could make an ansatz of analytical extension of the property above, and 
write 

Ansatz 4.2 



(50) 
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If this property is formally true, and not only valid for the discrete set of points x = q k , 
we could find pa(s) just comparing the series coefficients. Again from theorem ^] in 



Appendix A, we have 



h-l 



h-l 



JJ(» - <f ) = {-lfq = J] (1 - ig"*) = ^(-l)' 1 - 



fc(fc-l) 9(9-1) 



i=0 



i=0 
h-l 



9=0 



n^ 1 



g" — 



so, calling I = h — g, we have 



PA (5) = 



ff(ff+A) 



"^(_ g -(s+A+l)V J_ 



with a further use of theorem || we finally have 

P a(<7)=<T 9(9+A) : 



Wo,, 



(51) 



(52) 



Wo, s 7?-0,g+A 

Note that, as asymptotically the moments diverge faster than exponentially (S k ~ 
C q( k ~^> / 4 ), the heuristic ansatz 12 has no mathematical justification (see f2(|). 

4-3. Finite-size corrections 

In this procedure it is not difficult to introduce finite-size corrections. In fact, if we 
go back to the exact form of equation (|34|), we could write 



k,N 



(53) 



i<k 



so the linear system has the same matrix Akh, but a different inhomogeneous term: 

AkhS* = q- kA K N -k,N ■ (54) 

All the procedure of inversion of Akh are still valid, and we just need to replace 
equations (|47], 49 ) with the correct 

k 

= $>A(<?)(g fe ) 9 = £ <T hA n N - h , N B h {q k ) . (55) 

9 h=0 

Now, beyond a finite-size cutoff k ^> 2N + A (as IZn-h.n = for h > N), moments 
diverge like S k ~ C q N ( k ~ N ~ A ) ; anc l the finite-size version of the ansatz 4.2 is actually 
a precise mathematical statement: 

Ansatz 4.3 



Epa(3)x 9 = J2 q- hA n N ^ h . N B h {x) . 



(56) 



h=0 
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This is a rare case in which a careful finite-size regularization demonstrates the validity 
of a heuristic analytic continuation. 

Now just some algebra follows: we can correct equation (153) in 



q g(.g+A) 9 _ (a+A+l \ sl KN-g-l,N 

PN,M(g) = -= 2 ( - 9 ) — -p 



-A) T^0,N+AT^N-g,N 



1=0 



7to,g'R-Q,g+A 

(57) 

Introducing the rank of the matrix R = N — g to symmetrize the notations: 

mM (rank(J) = fl) = g^-W^fto,* \ N ] Pfl . (58) 

Finite-size scaling function of the SAT/UNSAT order parameter 

Equation (|58| ) provides us all the informations about the behaviour of our model, 
at zero temperature with respect to the fictitious Hamiltonian (^|). If we want 
to compare our results with the standard "open questions" of random satisfiability 
problems, we should concentrate on the order parameter for the SAT/UNSAT 
transition, that is 

0( 7 , M) = pn.m (rank(t/) = N) , 
where it is understood N = jM. The results of equation (B8h leads to 



0(7, m) = n 0>N 



The thermodynamic limit M — > oo, at fixed density 7, allows to determine the critical 
density j c = 1: 



0(7,oo)= lim 

M->oo /•C ,(1- 7 )M 



1 


7 


< 1 




7 


= 1 





7 


> 1 



(59) 
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We expect a non trivial M — ► oo limit when we keep fixed a certain combination of 
density and size of the system, (7 — 7 c )M 1 / !y . Again from equation (58) it is clear 



that the proper scaling window is obtained for v = 1. Introducing the parameter 
A = (7 C — j)M, and performing the thermodynamic limit at fixed A, we obtain back 
the result implicit in equation (p2|) : 



0(A) = km ^ = ?N- (60) 

M->oo 7Co,A 7m>,A 

Although this result makes sense only for A an integer number, analytical extension 
to A an arbitrary complex number is possible if we use the property 

„ _ 7m},oo 
'Ml 71 — 



to restate equation (|6(]) as 

+00 



^(A)=^ A ,oo=n( i -^ A_fc )- ( 6i ) 



fc=i 

A plot of this function for a typical value of q, and A a real number, is shown in figure. 



5. Derivation of pn.m{R) from Gauss algorithm 



As we have seen in section 3.2, we can perform a gauge transformation which, on the 
space of configurations, is essentially a change of basis. We can perform a gauge fixing 
to make clear the rank of a instance matrix J, and, from the analysis of this procedure, 
rederive the probability distribution for the rank, pn,m(R)i which we already found 
in a purely algebraic way in equation ( |58| ) . 

We will define our gauge fix in an algorithmic way, which we call Gauss algorithm, 
as it is essentially the standard algorithm to triangulate a matrix. We remark that we 
should only admit "moves" on the columns, as they correspond to a change of basis 
in the space of configurations (J — ► J-B -1 , with B 6 GL(N)), while we only admit 
simple permutations on the rows (J — > SJ, with S G q-Perm(M)). 



5.1. Step zero 

Gauss algorithm takes a M x N matrix J, and reduce it in a "normal form' 



1 






• 




1 





















1 




^red 







(62) 



where the central triangular matrix has side R = N — g, corresponding to the rank of 
the matrix. 

Suppose we know the probability distribution pn,m(R)- Now consider the two 
following inductive steps: 
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5.2. Induction on columns (add one variable) 

Take a M x (N + 1) matrix J, and apply the Gauss algorithm neglecting the last 
column c. The new matrix will be in a "pre-normal form" 



J (1) 



1 






• 






l 

























i 




c 


Jrcd 









(63) 



(note that the probability distribution for the last column is flat). Now reduce to zero 
the first R terms of the column via Gauss algorithm, obtaining 



7(2) = 



1 






• 







l 


























i 






^rcd 
















d 



(64) 



(note that the probability distribution for the remaining part of the column is still 
flat!). Now we have two ways to get further: 

c' contains only zeroes (this happens with probability q~ M+R ): the matrix is 
already in normal form, as we have simply added an empty row, so R' — R; 

c' contains some non-zero element (this happens with probability 1 — q~ M+R ): 
we will bring the column on position R+l, and eventually permute two of the 
last M — R rows in order to have a 1 in the diagonal term, and again a matrix in 
normal form. At the end, we have R' = R + 1; 

The recursion law deriving from these reasonings is resumed in the diagram below: 
Pn,m( r - 1 ) Pn,m(R) 

Pn+\,m{R-1) Pn+\,m{R) Pn+i,m(R+1) 

The analysis of the diagram leads to the equation 

Pn+i,m{R) = (1 - q- M+R -*)pNMR - 1) + Q~ M+R Pn,m(R) ■ (65) 
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Take a (M + 1) X N matrix J, and apply the Gauss algorithm neglecting the last row 
r. The new grid will be in a "pre-normal form" 





l 

l 





• 








J (1) - 

w norm 


^rcd 


1 




(66) 




r 





We call r' the last N — R terms of the row r, in analogy to the previous section. Note 
that the probability distribution for these terms is again flat. Now we have two ways 
to get further: 

r' contains only zeroes (this happens with probability q~ N+R ): the matrix is 
already in normal form, with R 1 = R; 

r' contains some non-zero element (this happens with probability 1 — q~ +R ): 
we will bring the row on position R + 1, and apply Gauss Algorithm to the last 
N — R rows, in order to have just one 1 left in the first term (the diagonal one), 
and again a matrix in normal form. At the end, we have R' = R + 1; 

Again the recursion law deriving from these reasonings is resumed in a diagram: 



Pnm{R - 1) 

PN,M+l{R - 1) 



Pn,m(R) 

I 

PN,M+l(R) 



1-<T 



PN.M+l{R + 1) 



The analysis of this diagram leads to a second equation 



Pn,m+i(R) = (1 



)Pn,m( r - 1 )+1- N+R PnM r ) 



(67) 



5.4- Derivation of the probability distribution 

The two equations j65| , |67| ) are linear and homogeneous. The quantities involved 
depends on three independent parameters: N, M and R. We will show "a posteriori" 
that the quantity pn,m(R) has a good thermodynamic limit M — * oo when the limit 
is performed at A = M — N and g = N — R fixed. We will call this limit function 

pa (g). 

This assumption allows us to write equations (65 6?]) as a system of two linear 
equations for p/\{g), in the two variables g and A: 



PA-i(g) = (i 



-9-1 



)pA{g + i) + q- 9 PA{g) 



PA+i(g) = (l - q- g - 1+A )PA(g + 1) + q- g+A PA(g) 

An easy to verify solution is 

-s(s+A) 



pa (g) =po(0) 



q 



IZo.glZ, 



gl<-0.g+A 



(68) 



(69) 
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1, 



lim pa(0) 

A^+oo 



which leads to 



pa( 9 ) = g-^+A) 



TZ 



+00 



25 



(70) 



(71) 



which is again equation (p2h. 

This formula was previously (independently) discovered by the Russian school 
(see [|2l Oi ) , with a procedure analogous to the one of this subsection. 



5.5. Finite-size corrections 

In this paragraph we will show how, with a careful use of the theory of partitions (see 
^7[ , and, for a summary of the main result, Appendix A), the only equation ( |65| ) (or, 
in a similar way, the only equation (67)) leads to the exact expression of pn,m(R), 
calculated directly from the initial conditions, with no need of any assumption. 
Equation (B5j) was 



PN+hM (R) = (1 - q-v+^PNMR " 1) + <t m+r pnM*) ■ 
It is more suitable to switch to the variable S = N — R instead of N for some steps: 

PM (S, R) = (l- ff- M+fl-1 )PAf (S, R~l) + q- M+R p M {S — 1,R). (72) 

Imagine to expand recursively this equation until we reconduce pm(S, R) to the initial 
conditions 



Pm(0,0) = 1; 



p M (S,R)=0 if S<0,R<0. 



This would produce a prolification of terms. Each of these terms can be labeled by a 
trajectory on a S x R grid, starting from the point (0, 0) on the top left corner, and 
stopping at the point (5, R) on the bottom right corner. This trajectory is directed: 
we only admit steps on the right nearest neighbour (s, r) — > (s,r + 1) or to the down 
nearest neighbour (s, r) — > (s + l,r). The weights of these steps, determined by 
equation §T£), are 

(s,r) ^ (s,r+ 1) 



(s + l,r) 

An interesting fact about these weights is that they only depend on r. For this reason, 
the contribute deriving from the horizontal steps is the same in all the trajectories, 
being 



R-l 

IK* 

r=0 



-M+r 



) = 1Zm-r,m = 7io,R 



(73) 
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The contribution deriving from the vertical steps is more complicated, and needs 
some results from the theory of partitions. Consider the trajectory as a string 
of S integer numbers (rj.,... ,Ts), corresponding to the length of the rows of the 
diagram individuated by the trajectory. As the trajectory is directed, we should have 
< n < r l+1 < R. 




(S,R) 

The vertical steps cont ribution for a certain trajectory will be Y[i=i1~ M+ri ■ As a 
result of theorem |l| in Appendix A we could write the generating function in the 
number of rows: 



/(*)=z* s e n* 

S (ri,...,r s )»=l 



S R 



n - 

l K 1 - zq 



and from theorem |3| we have 

R 



n i _ za~ M+r ~ e zkq 



r=0 



(M-R) 



R + k 
R 



-M+r ' 



(74) 



and, as we are interested in the term z , for S — N — R, we finally have 

n -(N-R)(M-R) 



which, combined with the contribution deriving from the horizontal steps, formula 



3[), gives again the result of equation (58) 



6. Conclusions 



We introduced a variant of the hyper-SAT model, by relaxing the constraint on the 
length of the clauses. This makes the model exactly solvable, so that we could not only 
compute the probability for an instance of being satisfiable, but also the probability 
distribution for the number of configurations which satisfy all the constraints. This 
probability distribution can be evaluated also for finite number of variables N, and 
of clauses M. The SAT/UNSAT transition occurs, in the thermodynamic limit, in 
which N and M go to infinity, with their ratio fixed to one. This model enjoys a dual 
formulation, in which the role of clauses and variables is interchanged, in agreement 
with the previous remark that the phase transition occurs when the number of clauses 
is equal to the number of instances. The whole analysis is soon extended to the general 
GF(q) case. Finite-size scaling functions are easily derived from the general solutions. 
The reasons for our work were mainly three: 

(i) Random matrices on GF(q), with N > M, are connected to random linear codes 
of the Error Correcting Codes theory, in particular to the Random Codeword 
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Model introduced by A. Montanari in The unexpected existence for an 

exact solution of the zero-temperature thermodynamic, and some other exact 
results also for the finite-temperature opens new perspectives in the study of this 
model. 

(ii) The random satisfiability of sparse linear systems on GF(q) is a polynomial 
problem, of computation complexity N 3 . A careful average-case complexity 
analysis shows both a SAT/UNSAT transition (discontinuity in the order 
parameter "probability for an instance to have at least one solution" , as a function 
of the number of variables to number of clauses ratio), and a "dynamical" 
transition in the complexity class, from linear to cubic complexity (see fill ). 
These characteristics make the hyper-SAT model an ideal toy model to investigate 
the underlying mechanisms of average-case complexity in the A-SAT problem. 
Our model is a further idealization of hyper-SAT, where the dynamical transition 
is lost, which has the advantage of an exact analytical control, also of the replica 
approach. 

(iii) The random satisfiability of sparse linear systems on GF(2) can be restated as 
a combinatorial problem in Random Hypergraphs theory. Our results are thus 
relevant also in this area. 

The analysis of our model at finite temperature will be presented elsewhere . 
Appendix A. Results from theory of partitions. 

In this appendix we summarize some results from analytic number theory of partitions, 
mainly due to the ancient work of Euler and Gauss on the generating function 
formalism. Our main reference for these results is Chapter 12 of H. Rademaker's 
textbook j27|. 

It is worth noticing that the mathematical results of this appendix are 
the groundwork for important results of number theory known as Rogers-Schur- 
Ramanujan identities, which have been shown to be connected to the Virasoro 
characters of conformal minimal models (see e.g. the articles of McCoy et. at, 
|p8[ ^9)). They are also known as q-calculus, and are connected with one-dimensional 
asymmetric exclusion processes (ASEP) (see e.g. the work of T. Sasamoto, @). 

Given an integer n, a partition [p] n is a list of ordered integers (711,712, . . . , rik) 
such that their sum is n: 

\p]n = (ni,n 2 , ... ,n k ); 1 < n,- < n i+ v, ^ m = n; 

A common tool consists in introducing formal power series in which the n-th series 
coefficient is the sum, on all the partitions [p] n of the integer 71, of a certain weight 
W([p]n)> I n the more general framework, the weight function could be valued in C. 
In the case of "well behaved" positive real weights, we can think to the power series 
as a grandcanonical partition function on the set of partitions. 

An analytic expression for these power series could be found for a wide class of 
weight functions. We will restrict ourselves to weights which factorizes on the elements 
of the list, that is, given a certain function w : N — > C, the weight of a partition is: 

k 
i=l 
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We can state the first easy result as follows: 

Theorem 1 The generating function for the partitions, with a factorized weight 

W(\p]n) = Ui=l w ( n i)> is 



CO 

5>"5>(M„)=n— 

" [p]n i=1 



zw(i) 



(A.l) 



In the most common case, in which we just want to count all the partitions of a 
certain integer, the proper weight would be w(n) = q~ n , so we will be concerned with 
products of the kind Yii(^ ~ ZC T 1 )- It will be useful, as in equations (|32|), to introduce 
the notation 



j=m+l 



n—m.n 



(A.2) 



A more common notation, but more tedious to handle in our context, is the following: 
n-i 



(a;x) n = JJ(1 



i=0 



(A.3) 



where the quantity on the right is known as the (n, to) Gaussian polynomial in the 
variable x. In this paper we always used q^ 1 as argument of 7Z m .n products. A few 
easy relations between the Gaussian polynomials are: 



(A.4a) 
(A.4b) 
(A.4c) 



n 




n 


to 




n — m 



n 




n — 1 




~n- 1 


_ c -(n-m) 


~n-l 




71 — 1 






TO — 1 


+ q- m 






to — 1 


+ 




m 




TO 




to 



n 




m 




n 




n 


-V 


to 




I _ 




l_ 




TO 


- 1 



Furthermore, at fixed to, in the limit n — > oo we have 

1 



lim 



72-0,-1 



Now consider the formal power series 

n n 

P„(x, z) = ]^[(1 - zi') = ^2 a m ,n(x)z r - 

i=l 

from the relation 



m— 



we find that 



. 1 - zx™ +1 , . 

P n {x,XZ) = — Pn{x,Z) 

1 — ZX 



1 — x n ~ mJrl 

1 — X m 
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and from the initial condition ao n (x) = 1 we find 

/ 1 — x n ~ m '+* 



o m ,r,(a:) = Yl 



1 - x m ' 



from which we recognize 
Theorem 2 



n(i-*g-*) = X;(-*) m g- 



OO 



z— 1 m— 



-i) l 



7e, 



0,m 



Analogously, if we consider the formal power series 

n _^ x 
<9n(x, z) = TT = V b m>n (x)i 

1 — zx % * — ' 

i=0 

from the relation 



m=0 



wc find that 



1 — z 
Q n (x,xz) = - _ 



^m,n(x) — - — bjn~ l.n{x) 



l-X 

and from the initial condition &o,nOc) = 1 we find 

1 - x m ' +n 



b m , n {x) = tt — 



m + n 
m 



from which we recognize 
Theorem 3 



m + n 
m 



TT — — = Y 

i=0 * m=0 

oo 1 oc 

TT — - = y^ m — . 



m=0 
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(A.5b) 



(A.6a) 
(A.6b) 
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